Depopulation is a major problem in rural areas of the world. The main aim of this work is the construction of a Spatial Depopulation Risk Index (sDRI) for the 919 municipalities of Castilla-La Mancha, using geostatistical techniques and principal component analysis. The theoretical semivariogram reveals spatial dependence up to a distance of 60 kilometers. Based on this range a neighbourhood network is constructed. Then a spatial principal component analysis (sPCA) is applied to a set of demographic variables. Finally, the sDRI is designed by extracting and scaling the first principal component of the sPCA. The resulting indicator identifies the areas with depopulation risk in which counter-measures can be applied.
Did you know that some areas of Cuenca y Guadalajara have a lower population density than Siberia? Depopulation is a major problem in rural areas of Castilla-La Mancha.
Table 1 shows that 445 municipalities of the region lost more than 20% of their population, whereas only 237 municipalities improved it in the last two decades (2001-2020).
| Population growth rate | Number of Municipalities |
|---|---|
| loss >20% | 445 |
| loss 10-20% | 131 |
| loss 5-10% | 62 |
| loss <5% | 44 |
| gain <5% | 43 |
| gain 5-20% | 67 |
| gain >20% | 127 |
As stated in the First Law of Geography, “Everything is related to everything else, but near things are more related than distant things” (Tobler, 1970), and since depopulation is a variable with spatial dependence, to carry out our purpose we deal with geostatistics and machine learning tecniques. Figure 1 stands for the methodology used in this work:
Figure 1: Methodology
The main aim of this study, the construction of a sDRI, is achived with the following esteps:
The instrument used par excellence to \(\color{#FFC000}{\text{detect the spatial dependence}}\) in a regionalized variable (Montero et al., 2015) is the semivariogram. Its expression is given by: \[\begin{equation} \tag{1} \gamma(s_i-s_j) = \frac{1}{2}V((s_i)-Z(s_j)), \forall s_i,s_j\in D \end{equation}\] where \(s_i\) and \(s_j\) are two locations (municipalities) in the domain \(D\), \(V\) is the variance, and \(Z(s)\) is the regionalized variable (Population growth rate) at location (municipality) \(s\).
\(\color{#FFC000}{\text{The range of spatial dependence }}\) is extracted from the semivariogram (see Figure 1).
Figure 2: Components of a semivariogram
Based on the range, \(\color{#453780}{\text{a neighbourhood network}}\) is constructed in the form of a proximity matrix \(L\).
Two types of spatial patterns are discriminated: global and local structures, corresponding respectively to large positive and large negative eigenvalues. This is accomplished by maximizing: \[\begin{equation} \tag{2} C(v) = V(Xv)I(Xv) = \frac{1}{n}(Xv)^TLXv = \frac{1}{n}v^TX^TLXv \end{equation}\] where \(V\) is the variance, \(X\) the demographic data matrix, \(I()\) the Moran’s \(I\), which catch the spatial autocorrelation, \(L\) the proximity matrix and \(v\) the scaled axes in \(R^{10}\), with \(||v||^2 = 1\). Figure 2 shows the extreme theoretical possibilities.
\(\color{#288A8C}{\text{The extraction of the first principal component}}\) is carry out.
The sDRI for each municipality is obtained \(\color{#288A8C}{\text{scaling from 0 to 100}}\) the principal component.
Figure 3: Theoretical cases: (a) spatial dependence, (b) no spatial dependence
The first step is the estimation of a semivariogram to analyze the spatial dependence and calculate its range. As shown in Figure 4 the semivariogram is adjusted to a spherical model with the following parameters: range of 60000 meters (60 km), sill of 3419, and nugget of 1667.
Figure 4: Adjusted semivariogram
Once the range of spatial dependence is estimated to 60 km, the neighbourhood network is constructed and the spatial analysis of principal components of depopulation in Castilla-La Mancha is performed. The results of the sPCA are shown in Figure 5.
Figure 5: Principal results of spatial principal component analysis: (a) Eigenvalues of sPCA; (b) Map of sPCA scores of municipalities.
The last step of the work is to extract the first component of sPCA and scale it from 0 to 100. The resulting sDRI is used to classify the municipalities from lack of depopulation risk (sDRI = 0) to extreme risk (sDRI = 100). As shown in Figure 6-left Albacete is the municipality with absolute abscence of depopulation risk (sDRI = 0), followed by Guadalajara, Talavera de la Reina, Toledo and Azuqueca de Henares. In the opposite, we have Arandilla del Arroyo (sDRI = 100), followed by Alique, Valsalobre, Angón and Pineda de Cigüela. Figure 6-right represents the sDRI in a map of municipalities of Castilla-La Mancha.
Figure 6: Depopulation Risk in municipalities of Castilla-La Mancha according to sDRI
The applied spatial principal component analysis results in a Depopulation Risk Index which identifies numerous areas as having a medium to high risk of depopulation; namely, the majority of villages of Cuenca and Guadalajara, and the west and the south of the region. Conversely, it shows no risk for the areas of La Mancha and the Sagra and Henares industrial corridors, as well as the provincial capitals, Talavera de la Reina and Puertollano (see Figure 5).
As far as we know, this is the firs time that this methodologies have been applied to mesure the depopulation risk, and, specifically, in the form of sDRI to classify the municipalities of Castilla-La Mancha.
Related to the social ans policy implications, we propose to include the scores of sDRI into an expert system capable of identifying the areas in which counter-measures can be applied by local and regional governments.